Genome-wide prediction of minor-groove electrostatic potential enables biophysical modeling of protein–DNA binding
نویسندگان
چکیده
Protein-DNA binding is a fundamental component of gene regulatory processes, but it is still not completely understood how proteins recognize their target sites in the genome. Besides hydrogen bonding in the major groove (base readout), proteins recognize minor-groove geometry using positively charged amino acids (shape readout). The underlying mechanism of DNA shape readout involves the correlation between minor-groove width and electrostatic potential (EP). To probe this biophysical effect directly, rather than using minor-groove width as an indirect measure for shape readout, we developed a methodology, DNAphi, for predicting EP in the minor groove and confirmed the direct role of EP in protein-DNA binding using massive sequencing data. The DNAphi method uses a sliding-window approach to mine results from non-linear Poisson-Boltzmann (NLPB) calculations on DNA structures derived from all-atom Monte Carlo simulations. We validated this approach, which only requires nucleotide sequence as input, based on direct comparison with NLPB calculations for available crystal structures. Using statistical machine-learning approaches, we showed that adding EP as a biophysical feature can improve the predictive power of quantitative binding specificity models across 27 transcription factor families. High-throughput prediction of EP offers a novel way to integrate biophysical and genomic studies of protein-DNA binding.
منابع مشابه
A DNA minor groove electronegative potential genome map based on photo-chemical probing
The double-stranded DNA of the genome contains both sequence information directly relating to the protein and RNA coding as well as functional and structural information relating to protein recognition. Only recently is the importance of DNA shape in this recognition process being fully appreciated, and it also appears that minor groove electronegative potential may contribute significantly in ...
متن کاملA map of minor groove shape and electrostatic potential from hydroxyl radical cleavage patterns of DNA.
DNA shape variation and the associated variation in minor groove electrostatic potential are widely exploited by proteins for DNA recognition. Here we show that the hydroxyl radical cleavage pattern is a quantitative measure of DNA backbone solvent accessibility, minor groove width, and minor groove electrostatic potential, at single nucleotide resolution. We introduce maps of DNA shape and ele...
متن کاملCharacterization of PicoGreen interaction with dsDNA and the origin of its fluorescence enhancement upon binding.
PicoGreen is a fluorescent probe that binds dsDNA and forms a highly luminescent complex when compared to the free dye in solution. This unique probe is widely used in DNA quantitation assays but has limited application in biophysical analysis of DNA and DNA-protein systems due to limited knowledge pertaining to its physical properties and characteristics of DNA binding. Here we have investigat...
متن کاملElectrostatic free energy landscapes for DNA helix bending.
Nucleic acids are highly charged polyanionic molecules; thus, the ionic conditions are crucial for nucleic acid structural changes such as bending. We use the tightly bound ion theory, which explicitly accounts for the correlation and ensemble effects for counterions, to calculate the electrostatic free energy landscapes for DNA helix bending. The electrostatic free energy landscapes show that ...
متن کاملMethylene blue binding to DNA with alternating AT base sequence: minor groove binding is favored over intercalation.
The results presented in this paper on methylene blue (MB) binding to DNA with AT alternating base sequence complement the data obtained in two former modeling studies of MB binding to GC alternating DNA. In the light of the large amount of experimental data for both systems, this theoretical study is focused on a detailed energetic analysis and comparison in order to understand their different...
متن کامل